IsoQuest, Inc.: NetOwlTM Server

نویسنده

  • IsoQuest Inc.
چکیده

1. Introduction NetOwl TM Server is a powerful text analysis software product developed by IsoQuest, Inc. It supports business intelligence by providing fast and easy access to information stored on local Intranets and the global Internet. It organizes, analyzes, and summarizes data extracted by NameTag TM and any Full-Text search engine. It then presents the data for either searching or browsing. NameTag is a data extraction and indexing tool that finds proper names and other defined entities within an input text stream. NetOwl is the total application built on the NameTag core engine. NetOwl tags each desired document or Web page by person, organization, location, relationship and description giving you a browsable "back-of-the-book index.'" This allows for targeting exactly key information that is sought. With NetOwl it is no longer necessary to scroll through massive amounts of text to find exacdy what is useful. NetOwl is a server-based program which generates standard CGI commands that can be executed by any standard Web browser. NetOwl's Loader ingests documents from any text or HTML-based data source, sequentially loads the documents, then passes them on to NameTag and the full-text search engine for data extraction. The full-text search engine data is stored in a proprietary database while NameTag's exlracted data is stored in any ODBC-compliant database. There are two major functional areas of the NetOwl system. The NetOwl Server system consists off • Loader • Full-Text Search • NameTag • Database interface • Client The second NetOwl functional area comprises customization, maintenance and monitoring tools which provide support to the underlying processes of NetOwl: • NetOwl Administrator Tool • NetOwl Service Manager Both major areas use the relational database for information storage and retrieval. This is the only link to each of the functional areas (i.e., all common information is accessed from the central database). The major functional areas never talk direcdy to each other or require or pass information between each other. Loader The Loader component of the NetOwl server collects information to process (by crawling specified sites, USENET news groups and/or locally accessible files) and then creates an index of that data in the NetOwl database. documents from the loader and creates an index of all relevant words. The index thus created can later be searched by the client component to provide a list of relevant documents given a set of keywords. Database The Database component stores the index information generated …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IsoQuest Inc.: Description Of The NetOwl (TM) Extractor System As Used For MUC-7

IsoQuest used its commercial software product, NetOwl Extractor, for the MUC-7 Named Entity task. The product consists of a high-speed C engine that analyzes text based on a configuration file containing a pattern rule base and lexicon. IsoQuest used the NameTag Configuration to recognize proper names and other key phrases in text, and mapped the product’s extraction tags to the MUC-7 NE tags. ...

متن کامل

5-02-30 Evaluating Client/Server Operating Systems: Focus on Windows NT

As organizations increasingly move mainframe-based applications to client/server platforms, Information Systems managers and their staffs face the task of selecting one of several client/server operating environments, including UNIX, Novell Inc.'s NetWare, and Microsoft Corp.'s Windows Microsoft New Technology operating system. The selection process is further complicated because both client an...

متن کامل

Using Server Log Files and Online Experiments to Enhance Internet Marketing

Unlike most traditional media, the Internet is both digital and interactive. Here we do not simply refer to interactions between consumers and a Web site or e-mail, but also between the marketer and the firm’s Web site or e-mail. Furthermore, the digital nature of the Internet records every interaction. These two characteristics — interactivity and digitization — facilitate research possibiliti...

متن کامل

Server Workload Consolidation — Evaluation of Unix Systems Partitioning

© 2002 Giga Information Group, Inc. All rights reserved. Reproduction or redistribution in any form without the prior permission of Giga Information Group is expressly prohibited. This information is provided on an “as is” basis and without express or implied warranties. Although this information is believed to be accurate at the time of publication, Giga Information Group cannot and does not w...

متن کامل

An advanced course in application programming and design

Copy right Idea Grou p Inc . Copy right Idea Grou p Inc . The continuing evolution in state-of-the-art business applications such as those that support e-commerce, advancements in programming language design such as Java™, and the requirements for persistent data access mechanisms have all significantly impacted the required knowledge-base of computer information science graduates. As such thes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997